ISO/IEC JTC 1/5С 2/WG3N 475 


Date : 1999-01-15 


ISO/IEC JTC 1/SC 2/WG 3 
7-bit and 8-bit codes and their extension 


SECRETARIAT : ELOT 


DOC TYPE : Proposed NP (proposed under review by WG or SC) 


TITLE : Proposal for a New Part of ISO/IEC 8859, -8bit single-byte 
coded graphic character sets -- Part 16: Latin No. 10 


SOURCE : National Body of Romania 

PROJECT: 02.20.10 

STATUS : For consideration at the 9" Plenary Meeting of SC 2, 
Fukuoka, Japan, 1998-03-16/17 

ACTION ID: ACT 

DUE DATE: 


Р, О and L Members of ISO/IEC JTC 1/SC 2 
DISTRIBUTION : WG Conveners, Secretariats 

WG 3 Members 

ISO/IEC JTC 1 Secretariat 


ISO/IEC ITTF 
MEDIUM: Open 
NO OF PAGES: 13 


Contact 1: Secretariat ISO/IEC JTC 1/5С 2/WG 3 ELOT Mrs K.Velli (acting) 
Acharnon 313, 111 45 Kato Patissia, ATHENS – GREECE 
Tel: +30 1 21 20 307 Fax : +30 1 22 86219 E-mail : kkb@elot.gr 


Contact 2 : Convenor ISO/IEC JTC 1/SC 2/WG 3 Mr E.Melagrakis 
Acharnon 313, 111 45 Kato Patissia, ATHENS – GREECE 
Tel: +30 1 21 20 301 Fax : +30 1 22 86219 E-mail: eem@elot.gr 


ISO/IEC JTC 1/SC 2/WG3N 475 


Information technology —8-bit single-byte coded graphic 
character sets 
Part 16: Latin alphabet No. 10 


1 Scope 


This part of ISO/IEC 8859 specifies a set of 191 coded graphic characters identified as Latin 
alphabet No. 10. 


This set of coded graphic characters is intended for use in data and text processing applications and 
also for information interchange. 


The set contains graphic characters used for general purpose applications in typical office environments 
in at least the following languages: 


Albanian, Croatian, English, French, German, Hungarian, Italian, Polish, Romanian, Slovenian. 


This set of coded graphic characters may be regarded as a version of an 8-bit code according to 
ISO/IEC 2022 or ISO/IEC 4873 at level 1. 


This part of ISO/IEC 8859 may not be used in conjunction with any other parts of ISO/IEC 8859. If 
coded characters from more than one part are to be used together, by means of code extension 
techniques, the equivalent coded character sets from ISO/IEC 10367 should be used instead within a 
version of ISO/IEC 4873 at level 2 or level 3. 


The coded characters in this set may be used in conjunction with coded control functions selected from 
ISO/IEC 6429. However, control functions are not used to create composite graphic symbols from two 
or more graphic characters (see clause 6). 

NOTE - ISO/IEC 8859 is not intended for use with Telematic services defined by ITU-T. If information coded 


according to ISO/IEC 8859 is to be transferred to such services, it will have to conform to the requirements of 
those services at the access-point. 


2 Conformance 


2.1 Conformance of information interchange 

A coded-character-data-element (CC-data-element) within coded information for interchange is in 
conformance with this part of ISO/IEC 8859 if all the coded representations of graphic characters 
within that CC-data-element conform to the requirements of clause 6. 

2.2 Conformance of devices 

A device is in conformance with this part of ISO/IEC 8859 if it conforms to the requirements of 

2.2.1, and either or both of 2.2.2 and 2.2.3. A claim of conformance shall identify the document which 


contains the description specified in 2.2.1. 


2.2.1 Device description 


ISO/IEC JTC 1/SC 2/WG3N 475 


A device that conforms to this part of ISO/TEC 8859 shall be the subject of a description that identifies 
the means by which the user may supply characters to the device, or may recognize them when they are 
made available to him, as specified respectively in 2.2.2 and 2.2.3. 


2.2.2 Originating devices 

An originating device shall allow its user to supply any sequence of characters from those specified in 
clause 6, and shall be capable of transmitting their coded representations within a CC-data-element. 
2.2.3 Receiving devices 

A receiving device shall be capable of receiving and interpreting any coded representations of 
characters that are within a CC-data-element, and that conform to clause 6, and shall make the 


corresponding characters available to its user in such a way that the user can identify them from among 
those specified there, and can distinguish them from each other. 


3 Normative references 


The following standards contain provisions which, through reference in this text, constitute provisions 
of this part of ISO/IEC 8859. At ће time of publication, the editions indicated were valid. All standards 
are subject to revision, and parties to agreements based on this part of ISO/IEC 8859 are encouraged to 
investigate the possibility of applying the most recent editions of the standards listed below. Members 
of IEC and ISO maintain registers of currently valid International Standards. 


ISO/IEC 2022:1994, Information technology — Character code structure and extension techniques. 


ISO/IEC 4873:1991, Information technology - ISO 8-bit code for information interchange — Structure 
and rules for implementation. 


ISO/IEC 8824-1:1995, Information technology — Abstract Syntax Notation One (ASN.1): Specification 
of basic notation. 


4 Definitions 


For the purposes of this part of ISO/TEC 8859 the following definitions apply: 
4.1 bit combination: An ordered set of bits used for the representation of characters. 
4.2 byte: A bit string that is operated upon as a unit. 


4.3 character: A member of a set of elements used for the organization, control, or representation of 
data. 


4.4 code table: A table showing the characters allocated to each bit combination in a code. 


4.5 coded character set; code: A set of unambiguous rules that establishes a character set and the 
one-to-one relationship between the characters of the set and their bit combinations. 


4.6 coded-character-data-element (CC-data-element): An element of interchanged information 
that is specified to consist of a sequence of coded representations of characters, in accordance with one 
or more identified standards for coded character sets. 


4.7 graphic character: A character, other than a control function, that has a visual representation 
normally handwritten, printed or displayed, and that has a coded representation consisting of one or 
more bit combinations. 


ISO/IEC JTC 1/5С 27 С 3 N 475 
NOTE – In ISO/IEC 8859 a single bit combination is used to represent each character. 
4.8 graphic symbol: A visual representation of a graphic character or of a control function. 
4.9 position: That part of a code table identified by its column and row coordinates. 


5 Notation, code table and names 


5.1 Notation 


The bits of the bit combinations of the 8-bit code are identified by Ба, b7 , bo, 65, b4 , b3 , 62, and bi, 
where bs is the highest-order, or most-significant bit and bi is the lowest-order, or least-significant bit. 


The bit combinations may be interpreted to represent numbers in binary notation by attributing the 
following weights to the individual bits: 


Bit bs b7 be bs b4 b3 b2 ы 
Weight 128 64 32 16 8 4 2 1 


Using these weights, the bit combinations are identified by notations of the form xx/yy, where xx апа 
yy are numbers in the range 00 to 15. The correspondence between the notations of the form xx/yy and 
the bit combinations consisting of the bits bs to bi is as follows: 


e xx is the number represented by Ы, b7 , бе and bs where these bits are given the weights 8, 4, 2, 
and | respectively. 


e уу is the number represented Бу ба, 63, b2 and bı where these bits are given the weights 8, 4, 2, 
and | respectively. 


The bit combinations are also identified by notations of the form hk, where h and k are numbers in the 
range 0 to F in hexadecimal notation. The number h is the same as the number xx described above, and 
the number k is the same as the number yydescribed above. 


5.2 Layout of the code table 


An 8-bit code table consists of 256 positions arranged in 16 columns and 16 rows. The columns and the 
rows are numbered 00 to 15. In hexadecimal notation the columns and the rows are numbered 0 to F. 


The code table positions are identified by notations of the form xx/yy, where xx is the column number 
and yy is the row number. The column and row numbers are shown at the top and left edges of the table 
respectively. The code table positions are also identified by notations of the form hk, where h is the 
column number and k is the row number in hexadecimal notation. The column and row numbers are 
shown at the bottom and right edges of the table respectively. 


The positions of the code table are in one-to-one correspondence with the bit combinations of the 
code. The notation of a code table position, of the form xx/yy, or of the form hk, is the same as that of 
the corresponding bit combination. 


5.3 Names and meanings 


This part of ISO/IEC 8859 assigns a unique name and a unique identifier to each graphic character. 


These names and identifiers have been taken from ISO/IEC 10646-1 (E). This part of ISO/IEC 8859 
also specifies an acronym for each of the characters SPACE, NO-BREAK SPACE and SOFT 
HYPHEN. For acronyms only Latin capital letters A to Z are used. It is intended that the acronyms be 
retained in all translations of the text. 


Except for SPACE (SP), NO-BREAK SPACE (NBSP) and SOFT HYPHEN (SHY), this part of 
ISO/IEC 8859 does not define and does not restrict the meanings of graphic characters. 
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This part of ISO/IEC 8859 specifies a graphic symbol for each graphic character. This symbol is shown 
in the corresponding position of the code table. However, this part, or any other part, of ISO/IEC 8859 
does not specify a particular style or font design for imaging graphic characters. Annex B of ISO/IEC 
10367 gives further information on this subject. 


5.3.1 SPACE (SP) 
A graphic character the visual representation of which consists of the absence of a graphic symbol. 


5.3.2 NO-BREAK SPACE (NBSP) 
A graphic character the visual representation of which consists of the absence of a graphic symbol, for 
use when a line break is to be prevented in the text as presented. 


5.3.3 SOFT HYPHEN (SHY) 
A graphic character that is imaged by a graphic symbol identical with, or similar to, that representing 
HYPHEN, for use when a line break has been established within a word. 


6 Specification of the coded character set 


This part of ISO/IEC 8859 specifies 191 characters allocated to the bit combinations of the code table 
(table 2). None of these characters are combining characters. 


NOTE – Combining characters are described in ISO/IEC 2022:1994 subclause 6.3.3. 


Control functions, such as BACKSPACE or CARRIAGE RETURN, shall not be used to create 
composite graphic symbols, which are made up from the graphic representations of two or more 
characters. 


6.1 Characters of the set and their coded representation 
See table 1. 
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Table 1 — Character set, coded representation 
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Table 1 (continuated) 
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| о | 
|%05 | 
| 06/03 | 
| 004 | 
| oss 
| 06/06 | 
| 07 | 
|908 | 
| 06/09 | 
[бю | 
foot | 
|612 | 
065 | 
|4 | 
[osas | 
[oroo | 
[оо | 
[ою | 
|005 | 
јо | 
(5 | 
|06 | 
[ою 
[oros | 
[oro |9 |009 | 
| отло | 
ЕЛЕН 
|012 | 
[omnis | 
| тла _____| 
Ете 
[1000 | 
| 100 _____ 
(оо | 
|15 | 
јо | 
|185: | 
| 1006 | 
| 1007 | 
| 1008 | 
| 10/09 || 
(оо | 
| поп _____| 
| 102 ____| 
pions | 
| 104 | 
| 1055 ____| 
| по | 
[ioi | 
| 12 | 
(чо | 
ја | 
| 105 | 
| 16 | 


ISO/IEC JTC 1/SC 2/WG3N 475 


Table 1 (continued) 
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LATIN SMALL LETTER O WITH ACUTE 
LATIN SMALL LETTER U WITH GRAVE 
LATIN SMALL LETTER U WITH ACUTE 


сісісісісісіс 


сісісісіс 


С 


6.2 Code table 


For each character in the set the code table (table 2) shows a graphic symbol at the position in the code 
table corresponding to the bit combination specified in table 1. 


The shaded positions in the code table correspond to bit combinations that do not represent graphic 
characters. Their use is outside the scope of ISO/IEC 8859; it is specified in other International 
Standards, for example ISO/IEC 6429. 
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Table 2 - Code table of Latin alphabet No. 10 
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7 Identification of the character set 


7.1 Identification according to ISO/IEC 2022 and ISO/IEC 4873 

The graphic characters of this part of ISO/IEC 8859 constitute a single coded character set. However in 
accordance with ISO/IEC 2022 and ISO/IEC 4873 the code table of this part of ISO/IEC 8859 may Бе 
considered to consist of the following components: 

e Тһе character SPACE represented by bit combination 02/00; 

e a94-character GO graphic character set represented by bit combinations 02/01 to 07/14; 


e a96-character СІ graphic character set represented by bit combinations 10/00 to 15/15. 


When the identification methods of ISO/IEC 2022 or ISO/IEC 4873 are used this part of ISO/IEC 8859 
shall be identified by the following pair of designation functions: 


GZD4 04/02 (ESC 02/08 04/02) 
G1D6 06/02 (ESC 02/13 06/02) 


NOTE – The corresponding escape sequences are shown in parentheses. 

7.2 Identification according to ISO/IEC 8824-1 (ASN.1) 

In the terminology of ISO/IEC 8824-1 the character set of this part of ISO/IEC 8859 and the 
corresponding coded representations are distinct, and are known as the “character abstract syntax” and 


the “character transfer syntax” respectively. 


When the identification methods of ISO/IEC 8824-1 are used this part of ISO/IEC 8859 shall be 
identified by the following object identifiers: 


e character set 
{ iso standard 8859 16 abstract-syntax (1) } 


% coded representations 
{ iso standard 8859 16 transfer-syntax (0) } 


The corresponding object descriptions shall be: 
e character set “ISO 8859 part 16 repertoire” 
% coded representations “ISO 8859 part 16 code” 


7.3 Identification using the ISO International register of coded character sets to be used 
with escape sequences 

According to 7.1 above the character set of this part of ISO/IEC 8859 may be considered to consist of 
the character SPACE, а 94-character GO graphic character set, and а 96-character G1 graphic character 
set. The GO and G1 graphic character sets may be identified by the use of the Registration Numbers 
from the ISO International register of coded character sets to be used with escape sequences. 


When these registration numbers are used this part of ISO/IEC 8859 shall be identifed by the following 
pair of registration numbers: 


e G0 graphic character set ISO-IR 6 


e Gl graphic character set ISO-IR ххх 
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ANNEX A 
(informative) 
Coverage of languages by parts 1 to 16 of ISO/IEC 8859 


A.1 Languages of European origin written in Latin script 


The following parts 1-16 of ISO/IEC 8859 specify coded character sets which comprise various 
different selections of characters based on the Latin alphabet. These sets are identified by the numbers 
1 to 10 as shown. 


ISO/IEC 8859-1 Latin alphabet No. 
ISO/IEC 8859-2 Latin alphabet No. 
ISO/IEC 8859-3 Latin alphabet No. 
ISO/IEC 8859-4 Latin alphabet No. 
ISO/IEC 8859-9 Latin alphabet No. 
ISO/IEC 8859-10 Latin alphabet No. 6 

ISO/IEC 8859-13 Latin alphabet No. 7 (Baltic Rim) 
ISO/IEC 8859-14 Latin alphabet No. 8 (Celtic) 

DIS 8859-15 Latin alphabet No. 9 

CD 8859-16 Latin alphabet No. 10 


Оооо 


1 
2 
3 
4 
5 


The following official and regional languages written in Europe are known to be covered by the Latin 
alphabets 1-10 as indicated by number in table A.1: 


Table A.1 — Language coverage 


Albanian 5 8 9 Irish Gaelic 
Basque | 5 8 9 (old ortography) 
Breton 1 5 8 9 Italian 1 3 5 8 9 
Catalan 1 5 8 9 Latin 123 4 5 6 7 8 9 10 
Comish 1 5 8 Latvian 4 7 
Croatian 2 10 Lithuanian 4 6 7 
Стесћ 2 Luxemburgish 1 5 8 9 
Danish 1 4 5 6 8 9 Maltese 3 
Dutch 1 5 9 Manx Gaelic 1 5 8 
English 12 3 4 5 6 7 8 9 10 Norwegian 1 456789 
Esperanto 3 Polish 2 10 
Estonian 4 6 7 9 Portuguese 1 3 5 8 9 
Faroese 1 6 9 Rhaeto-Romanic 1 5 8 9 
Finnish 1 45 6 9 Romanian (2) 10 
French D (3) (5) (8)9 10 Sami 4 6 
Frisian 1 5 9 Scottish Gaelic 1 5 8 9 
Galician 1 5 8 9 Slovak 2 
German 123 4 5 6 8 9 10 Slovenian 2 4 6 10 
Greenlandic 1 4 5 6 8 9 Sorbian 2 
Hungarian 2 10 Spanish 1 5 8 9 
Icelandic 1 6 9 Swedish 1 4 5 6 8 9 
Trish Gaelic 1 5 6 8 9 Turkish 5) 5 

(new ortography) Welsh 8 


NOTES 

1 The list of languages in table A.1 is not exhaustive. It shows the languages that are included in the Scope clause 
of each Part of ISO/IEC 8859. 

2. Use of Latin alphabet No. 2 for Romanian is deprecated. 


3. Use of Latin alphabet No. 3 for Turkish is deprecated. 
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ANNEX B 
(informative) 


A.2 Languages written in non-Latin scripts 


The following parts of ISO/IEC 8859 specify coded character sets which include characters from 
alphabets other than the Latin alphabet: 


ISO/IEC 8859-5 Latin/Cyrillic alphabet 
ISO/IEC 8859-6 Latin/Arabic alphabet 
ISO/IEC 8859-7 Latin/Greek alphabet 
ISO/IEC 8859-8 Latin/Hebrew alphabet 
DIS 8859-11 Latin/Thai alphabet 


The following official and regional languages are covered by these alphabets: 

The Cyrillic characters included in part 5 cover Bulgarian, Byelorussian, (Slavic) Macedonian, 
Russian, Serbian, and Ukrainian (as written up to 1990, see also Scope of part 5). 

The Arabic characters included in part 6 cover Arabic. The Greek characters included in part 7 cover 


Greek ( monotoniko orthography). The Hebrew characters included in part 8 cover Hebrew. The Thai 
characters included in part 11 cover Thai. 


Bibliography 
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